High-Dimensional Access Methods for Efficient Similarity Queries
نویسنده
چکیده
Retrieving similar complex documents such as images, sounds, DNA sequences, from within a large collection is an issues of main importance. While content modeling and retrieval algorithms tends to perform more and more efficiently, the methods to access the documents through their abstraction in form of high-dimensional feature vectors perform still poorly. In this report we detail the different access methods that have been proposed to perform similarity queries on multi-dimensional feature space, we present the reason of their inefficiency in highdimensional feature space and finally we review the attempts to solve these issues.
منابع مشابه
Metric Techniques for High-Dimensional Indexing
Despite the proposal of numerous tree-based structures for high-dimensional similarity searches, techniques based on a sequential scan, such as the VA-File, have been shown to be quite effective. In this thesis we present three new access structures which use sequential access patterns to ef£ciently answer similarity queries for high-dimensional vector and metric data. Two of these access struc...
متن کاملHDKV: supporting efficient high-dimensional similarity search in key-value stores
Key-value stores are widely used on large-scale data management in the cloud environment. However, they can only naturally support key-based queries, and do not have efficient solutions for value-based queries. Thus, dealing with high-dimensional data in key-value stores is still a big challenge. State-of-the-art solutions apply value-based tree-structure indexes to solve this issue. These meth...
متن کاملQuery Language for Complex Similarity Queries
For complex data types such as multimedia, traditional data management methods are not suitable. Instead of attribute matching approaches, access methods based on object similarity are becoming popular. Recently, this resulted in an intensive research of indexing and searching methods for the similarity-based retrieval. Nowadays, many efficient methods are already available, but using them to b...
متن کاملA Method for Protecting Access Pattern in Outsourced Data
Protecting the information access pattern, which means preventing the disclosure of data and structural details of databases, is very important in working with data, especially in the cases of outsourced databases and databases with Internet access. The protection of the information access pattern indicates that mere data confidentiality is not sufficient and the privacy of queries and accesses...
متن کاملBitmap Indices for Speeding Up High-Dimensional Data Analysis
Bitmap indices have gained wide acceptance in data warehouse applications and are an efficient access method for querying large amounts of read-only data. The main trend in bitmap index research focuses on typical business applications based on discrete attribute values. However, scientific data that is mostly characterised by non-discrete attributes cannot be queried efficiently by currently s...
متن کامل